Processing Information Graphics in Multimodal Documents
نویسندگان
چکیده
Information graphics, such as bar charts, grouped bar charts, and line graphs, are an important component of multimodal documents and cannot be ignored. When such graphics appear in popular media, such as magazines and newspapers, they generally have an intended message. We argue that this message represents a brief summary of the graphic’s high-level content, and thus can serve as the basis for more robust information extraction from multimodal documents. The paper describes our methodology for automatically recognizing the intended message of an information graphic, with a focus on grouped bar charts.
منابع مشابه
Directional Stroke Width Transform to Separate Text and Graphics in City Maps
One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...
متن کاملTracing Integration of Text and Pictures in Newspaper Reading
Newspapers and net papers are complex multimodal documents consisting of texts, pictures and graphics. Although we encounter such documents in our everyday life, there is still little empirical evidence about how these formats are processed. The question is how readers interact with these formats, combine information from all of the available sources and create coherence. In a naturalistic news...
متن کاملToward Extractive Summarization of Multimodal Documents
Summarization research has focused on text, and relatively little attention has been given to the summarization of multimodal documents. If extractive summarization techniques are to be used on multimodal documents containing information graphics (bar charts, line graphs, etc.), then a strategy must be devised both for extracting the high-level content of the information graphics and for identi...
متن کاملSemantic Modeling of Multimodal Documents for Abstractive Summarization
We describe a method for semantic modeling of multimodal documents and discuss how this can be used to generate an abstractive summary. Information extracted from the text by a semantic parser and from the graphics by a graph understanding system is combined into a single knowledge base. By operating at the semantic (rather than the surface) level, we are able to integrate information collected...
متن کاملSemi-automated annotation of page-based documents within the Genre and Multimodality framework
This paper describes ongoing work on a tool developed for annotating document images for their multimodal features and compiling this information into a corpus. The tool leverages open source computer vision and natural language processing libraries to describe the content and structure of multimodal documents and to generate multiple layers of XML annotation. The paper introduces the annotatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008